Question Answering Module Leveraging Heterogeneous Datasets

نویسندگان

چکیده

Question Answering has been a well-researched NLP area over recent years. It become necessary for users to be able query through the variety of information available - it structured or unstructured. In this paper, we propose module which a) can consume data formats heterogeneous pipeline, ingests from product manuals, technical forums, internal discussion groups, etc. b) addresses practical challenges faced in real-life situations by pointing exact segment manual chat threads solve user c) provides segments texts when deemed relevant, based on and business context. Our solution comprehensive detailed pipeline that is composed elaborate ingestion, parsing, indexing, querying modules. capable handling plethora sources such as text, images, tables, community flow charts. studies performed business-specific datasets represent necessity custom pipelines like proposed one several real-world document question-answering.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating Embedded Question Reuse in Question Answering

The investigation presented in this paper is a novel method in question answering (QA) that enables a QA system to gain performance through reuse of information in the answer to one question to answer another related question. Our analysis shows that a pair of question in a general open domain QA can have embedding relation through their mentions of noun phrase expressions. We present methods f...

متن کامل

Open-Domain Question Answering on Heterogeneous Data

Open-domain question answering is a hot research topic in recent years. QA Track held by NIST has offered a new evaluation on this topic. However, its target is aimed at plain text collection only. This paper focuses on QA for heterogeneous data, including plain text, summaries, tabular data, and video programs. Discussion of what necessary adaptation could be done to deal with such kind of dat...

متن کامل

Leveraging Video Descriptions to Learn Video Question Answering

We propose a scalable approach to learn video-based question answering (QA): to answer a free-form natural language question about the contents of a video. Our approach automatically harvests a large number of videos and descriptions freely available online. Then, a large number of candidate QA pairs are automatically generated from descriptions rather than manually annotated. Next, we use thes...

متن کامل

Fora: Leveraging the Power of Internet Communities for Question Answering

This paper introduces a system for searching question answer pairs automatically extracted from the discussions in internet communities. The system, named Fora, aggregates discussions from multiple forums and newsgroups in the same domain, automatically extracts question answer pairs from the data, and provides searches of the question answer pairs. The system also offers expert search, query s...

متن کامل

Leveraging Community-Built Knowledge for Type Coercion in Question Answering

Watson, the winner of the Jeopardy! challenge, is a state-of-the-art open-domain Question Answering system that tackles the fundamental issue of answer typing by using a novel type coercion (TyCor) framework, where candidate answers are initially produced without considering type information, and subsequent stages check whether the candidate can be coerced into the expected answer type. In this...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International journal on natural language computing

سال: 2021

ISSN: ['2278-1307', '2319-4111']

DOI: https://doi.org/10.5121/ijnlc.2021.10601